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Abstract 

We study the critical behavior of inhomogeneous random graphs where edges are present 
independently but with unequal edge occupation probabilities. The edge probabilities are 
moderated by vertex weights, and are such that the degree of vertex i is close in distribution 
to a Poisson random variable with parameter Wi, where Wi denotes the weight of vertex i. We 
choose the weights such that the weight of a uniformly chosen vertex converges in distribution 
to a limiting random variable W, in which case the proportion of vertices with degree k is 
close to the probability that a Poisson random variable with random parameter W takes the 
value k. We pay special attention to the power-law case, in which ¥{W > k) is proportional 
to fe-(^-i) for some power-law exponent r > 3, a property which is then inherited by the 
asymptotic degree distribution. 

We show that the critical behavior depends sensitively on the properties of the asymp- 
totic degree distribution moderated by the asymptotic weight distribution W. Indeed, when 
¥(W > k) < ck~^'^~^^ for all /c > 1 and some r > 4 and c > 0, the largest critical connected 
component in a graph of size n is of order n^/^, as on the Erdos-Renyi random graph. When, 
instead, ¥{W > k) = ck~^'^~^\l + o(l)) for k large and some r G (3, 4) and c > 0, the largest 
critical connected component is of the much smaller order n^'^~'^^^^'^~^\ 

1 Introduction and results 

We study the critical behavior of inhomogeneous random graphs, where edges are present indepen- 
dently but with unequal edge occupation probabilities. Such inhomogeneous random graphs were 
studied in substantial detail in the seminal paper by Bollobas, Janson and Riordan [7], where var- 
ious results have been proved, including their critical value by studying the connected component 
sizes in the super- and subcritical regimes. 

In this paper, we study the critical behavior of such random graphs, and show that this crit- 
ical behavior depends sensitively on the asymptotic properties of their degree sequence, i.e., the 
asymptotic proportion of vertices with degree k for each k > 1. Our results show that the critical 
behavior of our inhomogeneous random graphs admits a transition when the third moment of the 
degrees turns from finite to infinite. 
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1.1 Inhomogeneous random graphs: the rank-1 case 

In this section, we introduce the random graph model that we shall investigate. In our models, 
w = {wj)j^[n] are vertex weights, and in is the total weight of all vertices given by in = J2^=i ^i- 
We shall mainly work with the Poisson random graph or Norros-Reittu random graph [32], which 
we denote by NR,„(tt)). In the NR„(tt;), the edge probabilities are given by 

^ (1.1) 



(NR.) 



Pi, 

More precisely, p^^^^ is the probability that edge ij is present or occupied, for 1 < i < j < n, and 
different edges are independent. In Section 1.3, we shall extend our results to graphs where the 
edge probabilities are either = max{wjWj/£„, 1} (as studied by Chung and Lu in [11, 12, 13, 14]) 
or pij = WiWj/{in + WiWj) (as studied by Britton, Deijfen and Martin-Lof in [10]). See [7, Section 
16.4] for a detailed discussion of the relation between the general inhomogeneous random graph 
and the models studied here, which are called rank-1 inhomogeneous random graphs in [7]. 

Naturally, the graph structure depends sensitively on the empirical properties of the weights, 
which we shall now introduce. Let F be a distribution function, and define 



w 



[l-Fr\j/n), (1.2) 



where [1 — F] ^ is the generalized inverse oi 1 — F defined, for u G (0, 1), by 

[1 - F]~\u) = mi{s : [1 - F]{s) < u}. (1.3) 

By convention, we set [1 — -F]~-'^(l) = 0. 

In the setting in (1.1) and (1.2), by [7, Theorem 3.13], the number of vertices with degree k, 
which we denote by A^^, satisfies, with W having distribution function F appearing in (1.2), 

Nk/n^fk = E[e-'^—\, k > 0, (1.4) 

where — ^ denotes convergence in probability. We recognize the limiting distribution {/fc}^^ as 
a so-called mixed Poisson distribution with mixing distribution F, i.e., conditionally on W = w, 
the distribution is Poisson with mean w. Since a Poisson random variable with a large parameter 
is highly concentrated around that parameter, it is intuitively clear that the number of vertices 
with degree larger than k is, for large k, quite close to n[l — F{k)]. In particular, for a > 0, 
Xlfc ^"/fc < ^ precisely when E[iy^] < oo. 

In our setting, there exists a giant component containing a positive proportion of the vertices 
precisely when z/ > 1, where we define 

-5^. (1.5) 

E[W] ^ ^ 

As we explain in more detail in Section 1.3, we shall see that u arises as the mean of the size-biased 
distribution of W, which, in turn, arises as the mean offspring in a branching process approximation 
of the exploration of the connected component of a vertex. More precisely, if > 1, then the largest 
connected component has n({l + op{l)) vertices, while if < 1, the largest connected component 
has op(n) vertices. Here we write that X„ = op{bn) for some sequence when Xn/bn converges 
to zero in probability. See, e.g., [7, Theorem 3.1 and Section 16.4] and [11, 14, 32]. When u > 1, 
the rank-1 inhomogeneous random graph is called supercritical, when = 1 it is called critical, 
and when z/ < 1, it is called subcritical. The aim of this paper is to study the size of the largest 
connected components in the critical case. 
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1.2 Results 



Before we can state our results, we introduce some notation. We write [n] = {1, . . . ,n} for the 
set of vertices. For two vertices s,t E [n], we write s < — y t when there exists a path of occupied 
edges connecting s and t. By convention, v i — y v. For v E [n], we denote the cluster of v by 
C{y) = E [n]: V < — y x}. We denote the size of C{v) by |C(t')|, and define the largest connected 
component by |Cmax| = niax{|C(t')| : v G [n]}. Our main results are: 

Theorem 1.1 (Largest critical cluster for r > 4). Fix NR„(iu) with w = {wj)j(zin] as in (1.2), 
and assume that the distribution function F in (1.2) satisfies u = 1. Suppose there exists a t > 4 
and a constant Cp > such that, for all large enough x > 0, 

1 - F{x) < CpX-^^-^l (1.6) 

Let w = {wj)j^[n] be defined by 

Wj = (1 + en)wj, (1.7) 

and fix En such that \e^\ < An~^^^ for some A > 0. Then there exists a constant b = 6(A) > such 
that for all u > 1 and for n sufficiently large, NR„(ii;) satisfies 

Theorem 1.2 (Largest critical cluster for r G (3,4)). Fix NR„(iu) with w = {wj)j(zin] as in (1.2), 
and assume that the distribution function F in (1.2) satisfies v = 1. Suppose that there exists a 
T G (3, 4) and a constant < Cp < oo such that 

lim x^-^[l -F(x)] = c^. (1.9) 

Fix En such that \£n\ < An~^'^~'^'^^^'^^^^ for some A > 0. Then there exists a constant b = 6(A) > 
such that for all u > 1 and for n sufficiently large, NR„('u;), with w defined as in (1.7), satisfies 

1.3 Discussion and related results 

In this section, we discuss our results and the relevant results in the literature. We start by 
introducing some notation used throughout this paper. We write X ~ Poi(A) to denote that X 
has a Poisson distribution with (possibly random) parameter A, and a.„ = Q{bn) if there exist 
positive constants c and C, such that, for all n, we have cbn < an <i C6„. 

Branching process approximation. The main tool used in this paper is the comparison of 
clusters to branching processes. Let be a vertex chosen uniformly from [n]. Then, the number 
of neighbors of Vn is close to Poi(iy„), where Wn = wy^ is the (random) weight of Vn- In the 
setting of (1.2), we shall see that Wn converges in distribution to a random variable W having 
distribution function F, which explains that a uniformly chosen vertex has a degree that is close to 
Poi(H^) (recall (1.4)). As described in more detail in Section 3.2, we can describe the set of vertices 
to which Vn is connected by associating a random mark to each of the Poi(VFn) values, where the 
mark equals i G [n] with probability Wijin- Then, the set of neighbors of Vn equals the set of 
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marks chosen. Further, the distribution of the degree of a neighbor of Vn is close to X ~ Poi{wM), 
where M is the mark associated to the neighbor, and the degrees of different neighbors of Vn 
are close to an i.i.d. sequence. Thus, the cluster exploration is close to a branching process with 
offspring distribution X ~ Poi{Wif). It is not hard to see that Poi(wM) converges in distribution 
to Poi(W^*), where, for a non-negative random variable X with E[X] > 0, we let X* denote its 
size-biased distribution given by 

, E\Xlsx<x}] 

Thus, Poi(iy*) has finite variance when W has a finite third moment. For details, see Proposition 
3.4, where this connection is made explicit. We denote the mean offspring of the branching process 
by 

En 2 
W ■ 

Vn = ^t'oi[WM)\ = (1-12) 

In the setting of (1.2), we shall see that z/„ — > u, where u is defined by (1.5) (see Corollary 3.2(b) 
below). Therefore, the resulting branching process is critical precisely when u = 1. Observe that 
the offspring Poi(iy*) of this branching process has finite variance when r > 4, but not when 
r G (3,4). We now make use of the relation to branching processes to connect the subcritical and 
supercritical regimes to the critical one. 



Connecting the subcritical and supercritical regimes to the critical one. We first give 
a heuristic explanation for the critical behavior of n'^^^ appearing in Theorem 1.1. Let En = 
Vn — ^ and r > 4. By the branching process approximation, the largest connected component 
has size Pnn{l + o(l)) when En > 0, where p„ is the survival probability of the branching process 
approximation to the cluster. Now, p„ is of the order e„ when r > 4, since the corresponding 
branching process has finite variance in this case. On the other hand, the largest subcritical cluster 
is 0(e~^ log (ne^)) when < 0, since, for branching processes with finite mean, the probability 
that the total progeny exceeds k is approximately equal to Q{l/\/k)e~®^'''^"\ This suggests that 
the critical behavior arises precisely when = i.e., when e„ = and in this case, the 

largest connected component is e„n = n^^^ as in Theorem 1.1. 

We next extend this heuristic to the case r G (3,4), for which the picture changes completely. 
The results by Janson in [22] suggest that the largest subcritical cluster is like Wi/{1 — v) = 
Q[n^l^'^"^) l\Sn\) when z/„ = 1 + 5„ and e„ < 0. We note that [22] only proves this when i/ < 1 is 
fixed, but we conjecture that it extends to all subcritical v. In the supercritical regime, instead, 
the largest connected component should be like npn-, where pn is the survival probability of the 
(infinite variance) branching process approximation of the cluster. A straightforward computation 
shows that, when > and = o(l), we have pn ~ £n^'^ '^^ (see Lemma 3.6 below). Thus, this 
suggests that the critical behavior should now be characterized instead by taking n}/^'^~^'> / Sn = 
el!^'^~^'^n, which is = n~'^'^~^^/'^'^~'^\ In this case, the largest critical cluster should be of the 
order Theorem 1.2. This suggests that in both cases, the subcritical 

and supercritical regimes connect up nicely. In order to make these statements precise, and thus 
showing that Theorems 1.1-1.2 really deal with all the 'critical weights', we would need to show 
that when is much larger than n~^/^ and n~^'^~'^^/^'^^^\ respectively, the above heuristic bounds 
on |Cmax|, for both the super- and subcritical regimes, are precise. 
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The scaling limit of cluster sizes for r > 4. A special case of Theorem 1.1 is the critical 
behavior for the Erdos-Renyi random graph (ERRG), where bounds as in (1.8) have a long history 
(see e.g., [16], as well as [5, 25, 30, 34] and the monographs [6, 27] for the most detailed results). 
The ERRG corresponds to taking Wj = c for all j G [n], and then in (1.5) equals c. Therefore, 
criticality corresponds to Wj = 1 for all j G [n]. For the ERRG there is a tremendous amount of 
work on the question for which values of p, similar critical behavior is observed as for the critical 
value p = 1/n [1, 5, 16, 25, 29, 30]. Indeed, when we take p = {1 + \n~^^^)/n, the largest cluster 
has size Q{n'^^^) for every fixed A G M, but it is op(n^/'^) when A — — oo, and has size ^ n^/^ when 
A 3> 1. Therefore, the values p satisfying p = (1 + \n~^^^)/n for some A G M are sometimes called 
the critical window. 

Aldous [1] proves that the vector of ordered cluster sizes of the ERRG weakly converges to 
a limiting process, which can be characterized as the excursions of a standard Brownian motion 
with a parabolic drift, ordered in their sizes. A less well-known extension by Aldous [1] can be 
found in [1, Prop. 4], where an inhomogeneous random graph is studied in which there is an edge 
between i and j with probability 1 — e"^'^'^ for some vertex weights (xj)jg[„] and different edges 
are independent. This corresponds to our setting when we take Xi = Wi/\/I^. Aldous shows in [1, 
Prop. 4] that the ordered cluster weights weakly converge to a limit closely related to that of the 
ERRG, where the weight of a set of vertices C equals ^^ec^c- Since the completion of the first 
version of this paper, in fact the weak convergence of the ordered cluster sizes has been proved 
independently and almost at the same time in [35, 4], using related means as in [1], and under the 
slightly weaker condition that E[Vr'^] < oo. We have included the proof of Theorem 1.1 as this 
proof follows the same lines as the proof of the novel result in Theorem 1.2, and the proof nicely 
elucidates the place where the restriction r > 4 is used. It is not hard to see that our proofs in 
fact carry over to the situation where E[iy^] < oo, but we refrain from doing so for simplicity. 

The scaling limit of cluster sizes for r G (3,4). When r G (3,4), large parts of the above 
discussion remain valid, however, the variance of Poi(iy*) arising in the exploration process is 
infinite. Therefore, the critical nature of the total progeny of the branching process approximation 
is rather different, which is reflected in different critical behavior. Since the completion of the flrst 
version of this paper, in fact the weak convergence of the ordered cluster sizes has been proved in 
[3]. The proof relies on the fact that the cluster exploration can be described by a thinned Levy 
process having rather interesting behavior. In the proof in [3], the results derived in this paper, in 
particular Propositions 2.4 and 2.5 below, play a crucial role. 

In our results, we have assumed the precise power-law form of 1 — F in (1.9). A heuristic 
computation shows that when -u H- [1 — F]~^{u) = u~^^^^~^H{u) for some function u H- i{u) slowly 
varying at m = 0, then the size of |Cmax| becomes n*^'^~^^/*^'^~^)/£(l/n), and the width of the critical 
window becomes n^'^~^^^^'^~^^£{l/n)'^. This also sheds light on the critical cases r = 3 and r = 4. 
Indeed, when r = 3 and u i{u) is such that E[W^^] < oo, we predict that the above applies. If 
r = 4 and u H- l{u) is such that E[Vr^] < oo, then we predict that |Cmax| is of order n?/^ as in 
Theorem 1.1, while if r = 4 and u H- l{u) is such that E[W^'^] = oo, then we predict that |Cmax| 
is of order n^/^ / l{l/n), instead. The predictions for the critical window are accordingly. In our 
proofs, the presence of a slowly varying function should enter in Propositions 2.4-2.5 and Lemma 
3.6 below. 

Asymptotic equivalence and contiguity. We now deflne two random graph models that 
are closely related to the Norros-Reittu random graph. In the generalized random graph model 
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[10], which we denote by GRG„(if;), the edge probabihty of the edge between vertices i and j 
is equal to p-^'""'' = ^ ^w'w • random graph with prescribed expected degree or Chung-Lu 

random graph [11, 12, 13, 14], which we denote by CL„(iu), the edge probabihties are given by 
= maxj^^j^, 1}. The Chung-Lu model is sometimes referred to as the random graph with 
given expected degrees, as the expected degree of vertex j is close to Wj. 

By [24, Examples 3.5 and 3.6], in our setting, the graphs NR„(iu), GRG„(iu), and CL„(it?) 
are asymptotically equivalent (meaning that all events have the same asymptotic probabilities), so 
that Theorems 1.1-1.2 apply to GRG„(ti>) and CL„(iu) as well. 



The configuration model. Given a degree sequence, namely, a sequence of n positive integers 
d = idi)i^[n] with the total degree = J2i=i assumed to be even, the configuration model 
(CM) on n vertices with degree sequence d is constructed as follows: 

Start with n vertices and dj half-edges adjacent to vertex j. Number the half-edges from 1 
to in some arbitrary order. At each step, two half-edges (not already paired) are chosen 
uniformly at random among all the unpaired half-edges and are paired to form a single edge in 
the graph. Remove the paired half-edges from the list of unpaired half-edges. Continue with this 
procedure until all half-edges are paired. 

By varying the degree sequence d, one obtains random graphs with various degree sequences in 
a similar way as how varying w influences the degree sequence in the NR„(tt)) model studied here. 
A first setting which produces a random graph with asymptotic degree sequences according to 
some distribution F arises by taking (rfi)j6[„] = (-Dj)jg[„], where (L'i)je[„] are i.i.d. random variables 
with distribution function F. An alternative choice is to take ((ii)jg[„] such that the number of 
vertices with degree k equals \nF{kY\ — \nF{k — 1)]. 

The graph generated in the construction of the CM is not necessarily simple, i.e., it can have 
self-loops and multiple edges. However, if 



jy(CM) 



J2Mdr-l) (1.13) 



converges as n — !■ oo and dj = o{y/n) for each j G [n], then the number of self-loops and multiple 
converge in distribution to independent Poisson random variables (see e.g., [23] and the references 
therein). In [31], the phase transition of the CM was investigated, and it was shown that when 
piOM) _^ j^{CM) ^ -^^ certain conditions on the degrees are satisfied, then a giant component 
exists, while if z/''^'^^ < 1, then the largest connected component has size op(?7,). In [26], some of 
the conditions were removed. Also the barely supercritical regime, where n^^^^Un " 1) oo, is 
investigated. One of the conditions in [26] is that 'Yll=i '^t^^ — 0{n) for some 77 > 0, which, in the 
power-law setting, corresponds to r > 5. Here we write that f{n) = 0{g{n)) for a non-negative 
function g{n) when there exists a constant C > such that \ f{n)\/ g{n) < C for all n > 1. 

In [26, Remark 2.5], it is conjectured that this condition is not necessary, and that, in fact, 
the results should hold when - Ym=i converges. Similar results are proved in [28] under related 
conditions. The results in [26, 28] suggest that the barely supercritical regime for the CM is similar 
to the one for the ERRG when r > 4. We strengthen this by conjecturing that Theorems 1.1-1.2 
also hold for the CM when e„ = — 1 is replaced by = t^^^^ ~ 1- After the completion of the 
first version of this paper, a result in this direction was established in [18]. Indeed, denote 

R=\Y.dl (1-14) 
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Then, under the assumption that the maximal degree of the graph satisfies A„ = maxjg[„] di < 
n^/^R^/^ / logn, and for = — 1| < An~^/^R'^^^, jCmaxI is with high probabihty in between 

uv?/^R~^/^ and n^/'^R~^/'^ /uj for large a; > 1. 

When T > 4:, R remains uniformly bounded, while for r G (3,4), R = 0(n^~'^/^^^^^) and 
An = Q{n^^^'^~^'^)- Therefore, for r E (3,4), the bound on A„ has an extra l/logn too many to 
be able to compare it to our results. The high degree vertices play a crucial role in the scaling 
limit (as shown in [3]), so that we conjecture that the scaling limit is affected by this restriction. 
We also refer to the discussion in [18, Section 1.2] for a discussion on the relations between our 
results. It would be of interest to investigate the relations further. 

It is well know that when (1.13) holds and A„ = o(i/n), the CM is asymptotically contiguous 
to a uniform random graph with the same degree sequence. Indeed, the number of self-loops and 
multiple edges converge in distribution to independent Poisson random variables, which are both 
equal to zero with positive probability. Further, the CM conditioned on not having any self-loops 
is a uniform random graph with the same degree sequence. Also the generalized random graph 
conditioned on its degrees is also a uniform random graph with that degree sequence (see e.g., 
[10]). Since the degrees of the barely supercritical regime in the rank-1 inhomogeneous random 
graph as studied here satisfy the conditions in [26], the results there also apply to our model, 
whenever r > 5. We leave further details of this argument to the reader. 

2 Strategy of the proof 

In this section, we describe the strategy of proof for Theorems 1.1-1.2. We start by discussing the 
relevant first and second moment methods in Section 2.1, and in Section 2.2, we reduce the proof 
to two key propositions. 

2.1 First and second moment methods for cluster sizes 

We denote by 



the number of vertices that are contained in connected components of size at least k. Here, we 
write 1a for the indicator of the event A. 

The random variable Z>(. will be used to prove the asymptotics of |Cmax|- This can be under- 
stood by noting that |Cmax| > k occurs precisely when Z^^. > k, which allows us to prove bounds 
on |Cmax| by investigating Z^^ for appropriately chosen values of k. This strategy has been suc- 
cessfully applied in several related settings, such as percolation on the torus in general dimension 
[9] as well as for percolation on high-dimensional tori [8, 19, 21]. This is the first time that this 
methodology is applied to an inhomogeneous setting. 

The main aim of this section is to formulate the necessary bounds on cluster tails and expected 
cluster size that ensure the asymptotics in Theorems 1.1-1.2. This will be achieved in Propositions 
2.1-2.3 below, which derive the necessary bounds for the upper and lower bounds on the maximal 
cluster size respectively. Throughout the paper, we will use the notation {x Ay) = min{x, y}, (x V 
y) = max{x,y}. 

Proposition 2.1 (An upper bound on the largest critical cluster). Fix A > 0, and suppose that 
there exist 5 > I and ai = ai(A) > such that, for all k > n^/^^+^') and for Vn a uniformly chosen 




(2.1) 



ve[n] 
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vertex in [n], the bound 

P(|C(K)| >k)< ai(A:-V^+ (5„ Vn-(^-^)/(^+^))'/^'-'^) (2.2) 

holds, where 

\er,\ < An~('~'y('+'l (2.3) 
Then, there exists a bi = &i(A) > such that, for all uj > 1, 

P(|C„,,.| > < ^. (2.4) 

u 

The bound in (2.2) can be understood as a bound on the tail of the total progeny of a branching 
process, where the first term corresponds to the total progeny being finite and larger than k, while 
the second term corresponds to the survival probability of the branching process. This will be 
made precise in the sequel. 

Proof. We use the first moment method or Markov inequality, to bound 

1 n 

P(|C„,ax| >k)= P(Z>, >k)< -E[Z>,] = -^P(|C(K)| > k), (2.5) 

where Vn. G [n] is a uniformly chosen vertex. Thus, we need to bound P(|C(Ki)| > k) for an 
appropriately chosen k = kn. We use (2.2), so that 

< ai(a;-(i+i/^) + (ni/(^+i)|£„| ^(^-i) v 1)0;^^) 

< ai(u;-(i+i/^) + (1 + Ai/(^-i))a;-i), (2.6) 

when k = kn = cun^^^^'^^^^^ = ujn^^^^~^^\ and where we have used (2.3). This completes the proof 
of Proposition 2.1, with bi = ai(2 + A^^^^'^^). □ 

Proposition 2.1 shows that to prove an upper bound on |Cmax|j it suffices to prove an upper bound 
on the cluster tails of a uniformly chosen vertex. In order to prove a matching lower bound on 
|Cmax|j we shall use the second moment method, for which we need to give a bound on the variance 
of Z>(.. To state the result, we define 

X>Ap) = n\C{Vn)mciv„)\>k}l (2.7) 

where p = {pij)i<i<:j<n denote the edge probabilities of an inhomogeneous random graph, i.e., the 
edge ij is occupied with probability pij and the occupation status of different edges are independent. 
Then the main variance estimate on Z>fe is as follows: 

Proposition 2.2 (A variance estimate for Z^f.). For any inhomogeneous random graph with edge 
probabilities p = {pij)i<i<j<n, every n and k G [n], Var(Z>fe) < nx>k{p)- 

Proof. We use the fact that 

Var(Z>,)= J2 [nm\>k,\C{j)\>k)-n\C{z)\>k)n\C{j)\>k)]. (2.8) 

j,je[n] 
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We split the probability P(|C(i)| > k, \C{j)\ > k), depending on whether i < — > j or not, i.e., we 
split 

¥{\C{t)\ > k, \C{j)\ >k)= P(|C(z)| > k, \C{j)\ >k,t^ j) 

+ F{\Cit)\>k,\Cij)\>Kt^j). (2.9) 

We can bound 

¥{\C{z)\ > k, \C{j)\ >k,z^j)< f({\C{z)\ > k} o {\C{j)\ > k}), (2.10) 



where, for two increasing events E and F, we write E o F to denote the event that E and F occur 
disjointly, i.e., that there exists a (random) set of edges K such that we can see that E occurs by 
only inspecting the edges in K and that F occurs by only inspecting the edges in K'^. Then, the 
BK-inequality [2, 17] states that 

F{EoF) <F{E)¥{F). (2.11) 
Applying this to (2.10), we obtain that 

P(|C(z)| > k, |C(j)| >k,t^j)< P(|C(z)| > kM\C{j)\ > k). (2.12) 

Therefore, 

Var(Z>,) < P(|C(2)| > A;, |C(j)| > k.i^j), (2.13) 

i,je[n] 

and we arrive at the fact that 

Var(Z>,)< n\Cm>kACU)\>k,i^3) (2.14) 

= E[]l||c(,)|>fc}]l{jecW}] = $^Ej]l||c(,)|>fc} Y ^{jecW} 

i&[n]je[n] ie[n] jG[n] 

= 5^E[|C(z)|ll{|c(.)|>fc}] =nE[|C(K)|]l{|c(y„)|>fc}] =^X>.(P)- □ 

«G[n] 

Proposition 2.3 (A lower bound on the largest critical cluster). Suppose that there exist 5 > 1 
and 02 > such that for all k < and for Vn a uniformly chosen vertex in [n], 

n\C{Vn)\>k)>^, (2.15) 

while 

E[\C{Vn)\]<n^^'^^/^^+^\ (2.16) 
then there exists an 62 > such that, for all u > 1, 

P(|C^,.| < u;"V/(^+^)) < (2.17) 

Proof. We use the second moment method or Chebychev inequality, as well as the fact that |Cmax| < 
k precisely when Z>fe = 0, to obtain that 

P(|C_| < = P(Z,^-.„v(... = O) < ^"'^"^^'"^^ (2.18) 



By (2.15), we have that 

= nP(|C(K)| > > = a,a;VV/(^+i). (2.19) 

Also, by Proposition 2.2, with k = kn = uj~^n^^^^~^^\ 

Var(Z,„-i„./(^+i)) < rix>.-i„^v(^+i)(p) < n'+^'~'^/^'^'^ = n'"^'^^\ (2.20) 
Substituting (2.18)-(2.20), we obtain, for n sufficiently large. 



25/(5+1) I 

P.(|C„„I < < = (2^2i; 



This completes the proof of Proposition 2.3. □ 
2.2 Reduction of the proof to two key propositions 

In this section, we state two key proposition and use it to complete the proof of Theorems 1.1-1.2. 
We denote 

5=(rA4)-2. (2.22) 
Our main technical results are formulated in the following two propositions: 

Proposition 2.4 (An upper bound on the cluster tail). Fix NR„(io) with w = (wj)jg[n] in 
(1.2), and assume that the distribution function F in (1.2) satisfies u = 1. Assume that (1.6) holds 
for some r > A, or that (1.9) holds for some r G (3,4), and fix En such that \en\ < Xn~^^^^^/^^^^'^ 
for some A > 0. Let w be defined as in (1.7). Then, for all k > 1 and for Vn a uniformly chosen 
vertex in [n]: 

(a) There exists a constant Oi > such that 

P(|C(K)| >k)< a^{k-^" + [e^y n-^'-^^'^'+^^Y"-'-'^). (2.23) 

(b ) There exists a constant 02 > such that 

P(|C(K)|>A:)>|^. (2.24) 

Proposition 2.5 (An upper bound on the expected cluster size). Fix A > 1 sufficiently large, and 
let En < — A?7,~(^~-'^''/(^+-^^ Then, for Vn a uniformly chosen vertex in [n], 

MC{Vn)\]<n^^-^^/^^+^\ (2.25) 

Now we are ready to prove Theorems 1.1-1.2: 

Upper bounds in Theorems 1.1—1.2. The upper bounds follow immediately from Propo- 
sitions 2.1 and 2.4(a), when we recall the definition of 5 in (2.22), so that (2.3) is the same as 
\^n\ < An~^/^ for r > 4, as assumed in Theorem 1.1, and < An"^^"^''/'-^"^-' for r G (3,4), as 
assumed in Theorem 1.2. 
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Lower bounds in Theorems 1.1—1.2. For the lower bounds, we note that there is an obvious 
monotonicity in the weights, so that the cluster for Wi as in (1.7) with e„ = — An"^''"-'^)/^''^^) 
is stochastically smaller than the one for Sn with \en\ < Xn~^^~^^/^^^^\ Then, we make use 
of Proposition 2.3, and check that its assumptions are satisfied due to Propositions 2.4(b) and 
2.5. □ 



3 Preliminaries 

In this section, we derive preliminary results needed in the proofs of Propositions 2.4 and 2.5. 
We start in Section 3.1 by analyzing sums of functions of the vertex weights, and in Section 3.2 
we describe a beautiful connection between branching processes and clusters in the Norros-Reittu 
model originating in [32]. 

3.1 The weight Wn of a uniformly chosen vertex 

In this section, we investigate the weight of a uniformly chosen vertex in [n], which we denote by 
Wn- For this, we first note that 

[1 - F]-^{1 -u) = F-\u) = mi{x : F{x) > u}, (3.1) 

which, in particular, implies that = [1 —F]~^{U) has distribution function F when U is uniform 
on (0, 1). Further, Wn is a random variable with distribution function F„ given by 



n-l 

''{[l-F]-i(l-^)<x} 



Fn{x) = F{Wn < ^) = ^ E a{^.<.} = ^ E hl~F]-Hi)<.} = ^ E ^{ 

j=l j=l i=0 

^ n— 1 ^ n— 1 ^ 

= - E V-M-)<^} = - E h-<Fi^)} = - ( L^^(^)J + 1) 1' (3-2) 



Jl C ^ L \r,'- ' fl ^ ^ L„_ V /J ^ 

i=0 1=0 



where we write j = n — i in the fourth equality and use (3.1) in the fifth equality. Note that 
Fn{x) > F{x), which shows that Wn is stochastically dominated by W, so that, in particular, for 
increasing functions x i— )■ h{x), 

1 " 

-E^K) < E[/i(Vr)]. (3.3) 

" 3=1 

In the sequel, we shall repeatedly bound expectations of functions of Wn using the following lemma: 

Lemma 3.1 (Expectations of Wn). Let W have distribution function F and let Wn have distribu- 
tion function Fn in (3.2). Let h: [0, oo) C be a differentiable function with h{0) = such that 
\h'{x)\[l — F{x)] is integrable on [0,oo). Then, 

POO 1 rwi 

mh{Wn)]-^h{W)]\ < \h'{x)\[l- F{x)]dx + - \h'ix)\dx. (3.4) 

Jwi Jo 

Proof. We write, using the fact that h{0) = and that |/i'(x)|[l — F{x)] is integrable, for any 
5 > 0, 

/■oo rB 

E[hiW)l{w<B}]^ / h'ix)l{,^w<B}dx = / h\x)[FiB)-Fix)]dx. (3.5) 
'-Jo ^ Jo 
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Because of this representation, we have that 

POO 

E[h{Wn)]-E[h{W)]= h'{x)[F{x) - Fn{x)]dx. (3.6) 

Jo 

Now, Fn{wi) = 1 by construction (recall (3.2)), so that 

/•OO /"if 1 

E[h{Wn)]-E[h{W)] < / \h'{x)\[l- F{x)]dx+ / \h'{x)\[Fn{x) - F{x)]dx. (3.7) 



We finally use the fact that < -F„(x) — F{x) < 1/n to arrive at the claim. □ 

Corollary 3.2 (Bounds on characteristic function and mean degrees). Let W and Wn have dis- 
tribution functions F and F^, respectively, and assume that (1.6) holds for some r > 3. 

(a) Let 

'^-^^^ = E[wg ' ^^^^ - ~^W] — ■ ^ ^ ^ 

Then, 

(h) With V as in (1.5), z/„ m(1.12) and with Vn = (1 + en)i^n, 

We remark that if (1.6) holds for some r > 4 or (1.9) holds for some r G (3,4), then also (1.6) 
holds for that r > 3. This explains the assumption that r > 3 in Corollary 3.2. 

Proof, (a) We first take = 0, and split 

u^) - m - - ^) + E[^(^['^-"'^'="-"i - Ei'^^""'"-'!)- (3.11) 

Lemma 3.1 applied to h{x) = x yields |E[1V„] — E[W^]| < cn~^'^~'^^^^'^~^\ To apply Lemma 3.1 to 
h{x) = a;e^*^°"~^\ we compute 

\h\x)\ = |e^(*="-i) +i(e^*- l)xe**(^-^)| < l + x|t|. (3.12) 

Therefore, also using the fact that wi = e(n^/(^-i)) by (1.6), 

POO -| PWl 

E[lKe^"('='*^i)] - E[iye^(^'*"^)] </ {1 + x\t\)[l - F{x)]dx + - il + x\t\)dx 

Jwi ^ Jo 

< cn-(--2)/(--i) + c|t|n-(^-^)/(^-i). (3.13) 
Together, these two estimates prove the claim for £.„ = 0. For 5„ 7^ 0, we use the fact that 
E[iy,e(i+^")^"(^"-i)] E[H^,e'^"(°"-i)] _ E[W„e^"(^"-i)(e^"^"('="-i) - 1)] 

E[wg E[wg ~ E[wg 

= 0{\en\\t\E[W^]/E[Wr.]) = 0(|5„||t|). (3.14) 

(b) The proof of (3.10) is similar. □ 
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Lemma 3.3 (Bounds on moments of Wn). Let Wn have distribution function Fn in (3.2). 

(i) Assume that (1.6) holds for some r > 3, and let a < r — 1. Then, for x sufficiently large, 
there exists a C = C{a,T) such that, uniformly in n, 

E[l^„^ V.>-}] < Cx''^'-^. (3.15) 

(a) Assume that (1.9) holds for some t > 3, and let a > t — 1. Then, there exist Ci = Ci(a, r) 
and C2 = 6*2(0., t) such that, uniformly in n, 

C,{xAn'/^^~''>y^'-^ <E[W:i^w„<.}]<C2X^^'-^. (3.16) 
Proof, (i) When a < r — 1, the expectation is finite. We rewrite, using (3.5), 

POO 

E[W:i{w„>x}] = x'^il - Fn]{x) + a ^"-^[1 - F4v)]dv. (3.17) 

J X 

Now, 1 — Fn{x) < 1 — F{x), so that we may replace the Fn by F in an upper bound. When (1.6) 
holds for some r > 3, we can further bound this as 

POO 

E[W^l[w>x}] < CfX^^^^" + cpa / w^-^dw = Oix"^^-^). (3.18) 

J X 

(ii) We again use (3.5) and the bound in (1.9) to rewrite 

PX PX PX 

E[W^n V.<-}] = « / v''-^[Fn{x)-Fn{v)]dv<a v^^^il- F {v)]dv < c^a v''-^dv = 0{x''+^~^). 
Jo Jo Jo 

For the lower bound, we first assume that x < n^/^"^"^^ and use the fact that 



ex 



V.<-}] = « / ^""'{Fnix) - Fn{v)]dv > a / v'^-'lF^ix) - Fn{v)]dv 
Jo Jo 

pex pex 

= a v''-^[l-Fn{v)]dv-a'^[l-Fnix)] v°-^dv 
Jo Jo 

pex pex pex 

>a v''^^[l-F{v)]dv-- v^'-^dv -a[l- F{x)] v'^-^dv 
Jo Jo Jo 

> C{exy+^-^ - C{exYln - Cx-^''-^\exY 

= Ce^+'-^x^+^-^ll - - e"^^) > C2a;"+^-^ (3.19) 

when we take e G (0, 1) sufficiently small, and we use the fact that x < n^^^'^~^\ When x > n^^^'^~^\ 
we can use the fact that Wi = Q(n^/^'^~^^), so that 

E[W:i^w„<x}] > wt/n > (cni/(^-i))7n = C2{n'/^^-'^y-''~\ (3.20) 

□ 

3.2 Connection to mixed Poisson branching processes 

In this section, we discuss the relation between our Poisson random graph and mixed Poisson 
branching processes due to Norros and Reittu [32]. 

13 



Stochastic domination of clusters by a branching process. We shall dominate the cluster 
of a vertex in the Norros-Reittu model by the total progeny of an appropriate branching process. 
In order to describe this relation, we consider the cluster exploration of a uniformly chosen vertex 
Vn E [n]. For this, we define the mark distribution to be the random variable M with distribution 

P(M = m) = w^/£n, m e [n]. (3.21) 

We define So = I, and, recursively, for i > 1, 

Si = Si., + Xi - 1, (3.22) 

where (Xj)j>i is a sequence of independent random variables, where Xi has a mixed Poisson 
distribution with random parameter WM^, and where Mi is uniformly chosen in [n], while (Mj)j>2 
are i.i.d. random marks with distribution (3.21). Let 

T''> = inf{t > 1 : 5t = 0} (3.23) 

denote the first hitting time of of {Si)i>o. By exploring a branching process tree, we see that T'^' 
has the same distribution as the total progeny of a so-called two-stage mixed Poisson branching 
process, in which the root has Xi ~ Poi{wv„) children where Vn is chosen uniformly in [n], and 
all other individuals have offspring distribution given by Poi(wA/)- In the sequel, we shall use the 
notation T*^' for the total progeny of a two-stage mixed Poisson branching process with offspring 
distribution Xi ~ Poi^WmJ and the root has offspring distribution Xi ~ Poi(wv„). We shall use 
the notation T for the total progeny of a mixed Poisson branching process where every individual, 
including the root, has an i.i.d. offspring distribution Xi ~ Poi(wA/J, where (Mj)j>i is an i.i.d. 
sequence of marks with distribution described in (3.21). 
Clearly, wv„ has distribution Wn defined in (3.2), while 

n ^ n 

<x) = Y, iw<x-}P(M = m) = -Y, wmi{n.„^<.} = nw: < x), (3.24) 

m=l " m=l 

where W* is the size-biased distribution of Wn (recall (1.11)). In order to define the cluster 
exploration in NR„(if;), we define Xi = Xi and, for i >2, 

Xi = Xil{M,^{Ah,-,Mi-i}}^ (3.25) 

and define Si and T as in (3.22) and (3.23), where Xi is replaced by Xi. The definition in (3.25) 
can be thought of as a thinning of the branching process. The mark Mj corresponds to the vertex 
label corresponding to the i^^ individual encountered in the exploration process. If we have already 
explored this vertex, then we should ignore it and all of its subsequent offspring, while if it is a 
new vertex, then we should keep it and explore its offspring. Thus, the definition in (3.25) can 
be thought of ensuring that we only explore the offspring of a vertex once. We think of Xi as the 
potential new vertices of the cluster C{Vn) neighboring the i^^ explored vertex. A potential vertex 
turns into a real new element of CiVn) when its mark or vertex label is one that we have not yet 
seen. 

We will now make the connection between the thinned marked mixed Poisson branching process 
and the cluster exploration precise; this relation (in a slightly different context) was first proved in 
[32, Proposition 3.1]. Since the proof of [32, Proposition 3.1] is in terms of the number of vertices 
at distance k of the root, we reprove this result here in the setting of the cluster exploration: 
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Proposition 3.4 (Clusters as thinned marked branching processes). The cluster C(V^) of a uni- 
formly chosen vertex in [n] is equal in distribution to {Mi, . . . ,Mf}, i.e., the marks encountered 
in the thinned marked mixed Poisson branching process up to the end of the exploration T . 

Proof. We start by proving that the direct neighbors of the root agree in both constructions. We 
note that Xi = Xi, which has a mixed Poisson distribution with mixing distribution Wv,^-, which 
has the same distribution as Wn- 

Conditionally on = Z, Xi has a Voi{wi) distribution. These Poi(wj) offspring receive i.i.d. 
marks. As a result, the random vector (Xij)jg[,„], where Xij is the number of offspring of the root 
that receive mark j, is a vector of independent Poisson random variables with parameters wiWk/ in- 
Due to the thinning, a mark occurs precisely when Xij > 1, and these events are independent. 
Therefore, the mark j occurs, independently for all j G [n], with probability 1 — e^i'^k/f-n _ p'-!^^\ 
which proves that the set of neighbors of the root is equal in distribution to the marks found in 
our branching process. 

Next, we look at the number of new vertices of C{Vn) neighboring the i^^ explored potential 
vertex. First, conditionally on Mj = /, and assume that / ^ {Mi, . . . ,Mj„i}, so that X^ = Xi. 
Conditionally on Mj = j such that j ^ {Mi, . . . ,Mj_i}, X^ has a Poi{wj) distribution. Each of 
these Poi(wj) potential new vertices in C{Vn) receives an i.i.d. mark. Let Xij denote the number of 
potential new vertices of C{Vn) that receive mark j. Then, again is a vector of independent 

Poisson random variables with parameters wiWk/in- Due to the thinning, a mark occurs precisely 
when Xij > 1, and these events are independent. In particular, for each j ^ {Mi, . . . ,Mj}, the 
probability that the mark j occurs equals 1 — e'^j'^k/^n _ p^.^^^, as required. □ 

Proposition 3.4 implies that, for all /c > 1, 

P(|C(K)| > A;) < P(f > A;) < P(T'^' > k). (3.26) 

Interestingly, when the weights equal Wi = nlog(l — X/n) for each i E [n], for which the graph 
is an Erdos-Renyi random graph with edge probability X/n, the above implies that |C(Ki)| is 
stochastically dominated by the total progeny of a Poisson branching process with parameter 
nlog(l — X/n). Often, the stochastic domination is by a branching process with binomial n — 1 
and p = X/n offspring distribution instead. 

Otter-Dwass formula for the branching process total progeny. Our proofs make crucial 
use of the Otter-Dwass formula, which describes the distribution of the total progeny of a branching 
process (see [15] for the special case when the branching process starts with a single individual 
and [33] for the more general case, and [20] for a simple proof based on induction). 

Lemma 3.5 (Otter-Dwass formula). Let (Xj)j>i be i.i.d. random variables. Let denote the 
Galton- Watson process measure with offspring distribution Xi started from m initial individuals, 
and denote its total progeny by T. Then, for all k,m eN, 

k 

Fm{T = k) = JF{J2x^ = k-m). (3.27) 

i=l 

The survival probability of near-critical mixed Poisson branching processes. We shall 
also need bounds on the survival probability of near-critical mixed Poisson branching processes. 
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Lemma 3.6 (Survival probability of near-critical mixed Poisson branching processes.). Let pn be 

the survival probability of a branching process with a Poi(W*) offspring distribution. Assume that 
En = IE[py^] — l = z/„ — 1>0 and En = 0(1). When (1.6) holds for some r > 4, there exists a 
constant c > such that 

Pn < CEn- (3.28) 

When (1.9) holds for some r G (3,4), 

Pn<c(4/(^-3Wn-i/(^-i)). (3.29) 

Proof. By conditioning on the first generation, the survival probability p of a branching process 
with offspring distribution X satisfies 

l-p = E[(l-p)^]. (3.30) 
In our case, X ~ Poi(W^„*), for which E[s^"''-^"^] = E[e^"^'-^^], so that 

1 -p„ = E[e-''"^"]. (3.31) 
We use the fact that e~^ > 1 — x when x > 1/2 and e~^ > 1— x + x^/4 when x < 1/2, to arrive at 

1 - > 1 - Pnm:] + ^mW:)%,^^w^<y2}] . (3.32) 

Rearranging terms, dividing by p„ and using that IE[1V^] = i^n, we obtain 

PnE[{W:)%p„w*<i/2}] < 4(m„ - 1) = 4£„. (3.33) 
When (1.6) holds for some r > 4, 

K{w:)%,„w*<i/2}] = mwrKi + = + ^(i)), (3.34) 

so that pn < CEn ioT somc constant c > 0. When, on the other hand, (1.9) holds for some r G (3, 4), 
then when p„ < 2n~^^^'^~^'^ the claimed bound holds. When, instead, pn > 2n~^^^'^~^\ we may 
apply Lemma 3.3(ii) to obtain 

Hy^n) Hp„Wi<l/2}\ - > CPn , (3.35) 

SO that cpn~^ < 4:En- Combining these two bounds proves (3.29). □ 



4 An upper bound on the cluster tail 

In this section, we shall prove the upper bound on the tail probabilities of critical clusters in 
Proposition 2.4(a). 
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Dominating the two-stage branching process by an ordinary branching process. We 

rely on (3.26). Unfortunately, the Otter-Dwass formula (Lemma 3.5) is not directly valid for T^^\ 
and we first establish that, for every k > 0, 

F(T('^ >k)< P(T > k). (4.1) 

The bound in (4.1) is equivalent to the fact that T'^' ^ T, where X ^ Y means that X is 
stochastically smaller than Y. Since the distributions of T'^' and T agree except for the offspring 
of the root, where T'^' has offspring distribution Poi{Wn), whereas T has offspring distribution 
Poi(PV^), this follows when Poi(VF„) ^ Poi{W*), For two mixed Poisson random variables X,Y 
with mixing random variables Wx and Wy, respectively, X ^ Y follows when Wx ^ Wy The 
proof of (4.1) is completed by noting that, for any non-negative random variable W, and for W* 
its size-biased version, we have W ^ W*. 

The total progeny of our mixed Poisson branching process. By (3.26) and (4.1), 

oo 

P(|C(K)| >k)< P(T > A;) = P(T = cx)) + P(A; < T < cx) = P(T = oo) + J^P(T = I) 

l=k 

oo I 

= P(T = oo) + ^ -P( J] X, = / - 1), (4.2) 

l=k i=l 

where the last formula follows from Lemma 3.5 for m = 1, and where {Xi)°Zj^ is an i.i.d. sequence 
with a Poi(iy^) distribution. In the following proposition, we shall investigate P(X]i=i = ^ ~ !)• 

Proposition 4.1 (Upper bound on probability mass function of Yl\=i ^i)- (-^j)i^i be an i.i.d. 
sequence with a mixed Poisson distribution with mixing random variable W* = (1 + en)W*, where 
W* is defined in (3.24). Under the assumptions of Proposition 2.4(a), there exists an di > such 
that for all I > n^/^^+^^ and for n sufficiently large, 

p(^Xi = / - 1) < ~a,{l''/' + A 1)/-^/'), (4.3) 

i=l 

where S > is defined in (2.22). 

Proof. We rewrite, using the Fourier inversion theorem, and recalling 0n(t) in (3.8), which we can 
identify as 0„(t) = Efe'*^^] = Efe^^""^)^-], 



.(t)f^, (4.4) 



SO that 



it)\f^. (4.5) 



P(^X, = /-l) < f |0„ 

By dominated convergence and the weak convergence of W* to W*, for every t G [— tt, tt], 

lim (j)n{t) = (f){t) = E[e^*(^"-i)]. (4.6) 
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Since, further, 

(t) I = |E[iy:e'*e(^"-^)^"] I < nw:] = (1 + = 1 + o(l), (4.7) 

which is uniformly bounded, the convergence in (4.6) is uniform for all t G [— 7r,7r]. Further, for 
every ?7 > 0, there exists e > such that |0(t)| < 1 — 2e for all \t\ > rj, since our mixed Poisson 
random variable is not degenerate at 0. Therefore, uniformly for sufficiently large n, for every 
r] > 0, there exists e > such that |0n.(^)| < 1 — e for all \t\ > rj. Thus, 



/ \Ut)\'^<il-ey+ [ \Mt)\'^- (4.8) 



We start by deriving the bound when r > 4, by bounding 

\Mt)\ <E[e-^"[i-^°^(*)l]. (4.9) 

Now using the fact that, uniformly for t G [— vr, vr], there exists an a > such that 

l-cos(t)>at^ (4.10) 

and, for a; < 1, the bound e~^' < 1 — x/2, we arrive at 

|0n(t)| <E[e"'^^"*^] <E[(l-aW:tV2)]l^,^.,.<,j] +E[l^,^,,.^,^] 

= 1 - at'E[W:] + E[]l^„^.,.>,j(l + aW:ty2)] . (4.11) 

Further bounding, using Lemma 3.3 and r > 4, 

E[%w*t^>i}(l + aW:ty2)] < lam[\aW,t^^,}W:] = oif), (4.12) 

we finally obtain that, uniformly for t G [—77,77], there exists a 6 > such that |0n(^)| < 1 — bt"^- 
Thus, there exists a constant 02 > such that 

/ \Mt)\'l^ <il-ey+ [ (1 - bt^yl^ < ^, (4.13) 

which proves (4.3) for 6 = 2 and r > 4. 

In order to prove (4.3) for r G (3,4), for which 5 = r — 2 < 2, we have to obtain a sharper 
upper bound on |0„(t)|. For this, we identify (pnit) = Re(0„(t)) + ilm(0„(t)), where 

Re(0„(t)) = E[cos(lV'„* sin(t))e-^"[i-'=°^W]], lm(0„(t)) = E[sin(iy„* sin(t))e-^"I^-=°^Wl] , 

(4.14) 

so that 

|0„(t)P = Re(0„(t))2 + Im(0„(t))2. (4.15) 
We start by upper bounding |Im(</)„(t))|, by using that | sin(t)| < |t| for all t G M, so that 

iim(0.(t))i < \t\nw:] = m + o(i)). (4.16) 

Further, 

Re(0„(t)) = 1 - E[l - cos(WC sin(t))] + E[cos(iy: sin(t))[e-^"[^-^°^(*)l - 1]] . (4.17) 
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By the uniform convergence in (4.6) and the fact that, for 77 > small enough, Re(0(t)) > 0, we 
only need to derive an upper bound on Re(0„(t)) rather than on |Re(0„(t))|. For this, we use the 
fact that 1 — < x and 1 — cos(t) < to bound 

|E[cos(IK sin(t))[e-^"[^-=°^(*)l - 1]] | < E[1 - e-^-Ii— < [1 - cos(t)]E[iy„*] < i>„tV2. 

(4.18) 

Further, using (4.10) whenever W*\t\ < 1, so that also W*|sin(t)| < W*\t\ < 1, and 1 - 
cos(VF^ sin(t)) > otherwise, we obtain 

Re(0„(t)) < 1 - asin(t)2E[(l^:)^ J + z>.tV2 = 1 - ^t'^^^^^^^^ + ujy2. (4.19) 

By Lemma 3.3, we have that 

E[W^l{w.\t\<i}] > Vn-i/(^"i))^-'. (4.20) 

Combining (4.18) with (4.20), we obtain that, uniformly in |t| < rj for some small enough f] > 0, 

which, combined with (4.15) and (4.16), shows that, for |t| < 77 and 77 > sufficiently small, 

, w M fe"""'^*'"" for \t\ > 72^1/(^-1), 

Thus, there exists a constant cii > such that 



< (1 - e)' + / e-'-^-l'I'^^/^^t + / g-;a.,i^n(-^)/(^-)/2^^ 

,n] 27r Jl~V,v] J[-n-i/(T-i)_„-i/(r-i)] 

< ^ + "-^^^ = ai (/-/^ + n(^-)/^(^-)/-V^) , (4.23) 

which proves (4.3) for r G (3, 4) and with 5 = r — 2. □ 
Proof of Proposition 2.4(a)- By (4.2) and Lemma 3.6, 

P(|C(K)| > A:) < c(4/(^-^' V n-V(^-i)) + 5, J] ^ + /"^/^ 

< c{en V + 1^ + (n(--4)/2(-i) a l)k-'/'. (4.24) 

The proof is completed by noting that, for k > n^/^^+^') = 77,('^-2)/(t-i)^ 

^(r-4)/2(r-l)^-l/2 < ^(r-4)/2(r-l) ^(r-2)/2(r-l) _ ^-l/(r-l)_ (4^25) 

Thus, the last term in (4.24) can be incorporated into the first term, for the appropriate choice of 
ai. This proves the claim in (2.23). □ 
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An upper bound on the expected cluster size: Proof of Proposition 2.5. We now slightly 
extend the above computation to prove a bound on the expected cluster size. We pick A > so 
large that t'n = (1 + < 1 — n~^^~^'>^^^^^\ which is possible since i^n — 1 < cn~^'^~^^^^'^~'^^ < 

cn-(^-i)/(^+i) by Corollary 3.2(b) and (2.22). Then, by (3.26) and (4.1), as required 

E[|C(K)|] < E[T(''] < E[T] = -^-^ < r2('5-i)/('^+i). (4.26) 



5 A lower bound on the cluster tail 

In this section, we prove a lower bound on the cluster tail. The key ingredient in the proof of 
Proposition 2.4(b) is again the coupling to branching processes. Note the explicit coupling between 
the cluster size |C(V^)| and the total progeny T<^' described there. We can then bound 

P(|C(K)| >k)> P(T''> > 2k, \C{Vn)\ >k)= P(T''> > 2k) - P(T'') > 2k, \C{Vn)\ < k). (5.1) 

The following lemmas contain bounds on both contributions: 

Lemma 5.1 (Lower bound tail total progeny). Under the assumptions of Proposition 2.4(b), there 
exists a constant 02 > such that, for all k < en^^~^^^^^'^^\ 

nT'^' >k)>'^8- (5-2) 

Lemma 5.2 (Upper bound cluster tail coupling). Fix e > sufficiently small. Under the assump- 
tions of Proposition 2.4(b), there exists constants c,q > such that, for all k < en^^~^''^^^^^\ 

nT^^^>2k,\C{V^)\<k)<^. (5.3) 

Proof of Proposition 2.4(b) subject to Lemmas 5.1-5.2. Recall (5.1), and substitute the bounds in 
Lemmas 5.1-5.2 to conclude that 

WV;)l>^)>^-|;;>^. (5.4) 

when e > is so small that 2^"^/'' 02 ~ ce'^ > 0'2- This is possible, since 5 > 1. □ 
Proof of Lemma 5.1. We start by noting that 

P(T<'^ >k)> P(T''> >k,Xi = l)=W{T>k- l)P(Xi = 1) > P(T > k)'¥{Xi = 1). (5.5) 

Note that P(Xi = 1) = E[W„e-'^"] = E[lVe-'^] + o(l), which remains strictly positive. Thus, it 
suffices to prove a lower bound on P(T > k). For this, we bound 

00 00 ^ I 2k ^ I 

P(T > fc) > $^P(T = /) = 5^ yP($^X, = / - 1) > 5^ -P( ^X, = / - 1). (5.6) 

l=k l=k i=l l=k i=l 

We prove the bounds for r G (3,4) and r > 4 simultaneously, the latter being somewhat 
simpler. We shall follow a large part of the analysis for the upper bound in Proposition 4.1. Recall 
(4.4), to get 



P(^X, = /-1) = / Re(e^*(e-"0.(t))')f . 



(5.7) 
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It is not hard to see that, by the arguments in the proof of Proposition 4.1 (in particular, recall 
(4.22)), 



(e"-l)(e-'Vn(t))' 



dt 
2^ 



and, with K, = Kl-^/\ 



^7T,7t]\[-Ki,K,] 



27r 



(5.8) 



(5.9) 



so we are left to prove a lower bound of the form c/"^/^ for Re((e-'*0n(t)) )£• Note that, 

by the uniform convergence of e~'*0„(t) to e~'*</)(t), we have that e~'*0„(t) — > 1 uniformly for 

te[-Ki,Ki]. 

Let (pnit) = e~^^(t)n{t),ip{t) = e~'*0(t). By scaling. 



/ Re(v.„,(t)0^ = rV^/ Re(^„(trV^)')|^. 



We rewrite 



K] 



SO that 

Re(<^„(t)') = Re(^ip{ty^Re(l + 
When t 0, with a = (r A 4) - 2, 



(5.10) 



(5.11) 



(5.12) 



(5.13) 



since W* — 1 is in the domain of attraction of an a-stable distribution when (1.6) or (1.9) hold, 
and thus also X = Poi(H^*) — 1 is. Thus, pointwise in t as Z — >■ oo. 



')') = e-^l*l"(l + 0(1)), Im((p(t/-i/^)') = 0(1). 



Re((^(t/-i/(^-2)V 
Therefore, also using the fact that 

which is integrable, dominated convergence gives that, for every K > 0, 



-K,K] 



Imfy?(t/-^/^)')lmfl 



(5.14) 



(5.15) 



(5.16) 



By Corollary 3.2(a) and the fact that \(^{t)\ > 1/2 for all \t\ < Ki 



Re 1 



) > (1 - 2|(/^„(t) - ip{t)\y > 1 - 2i\ip^{t) - ipit)\ 

> 1 - 2/cn"(-"2)/(^-i) + 2cl\t\ (|5„| + 

> 1 - 2c(/n-(--2)/(-i)) - 2c|t|/i/(^-2)(/n-(^-2)/(--i))(-3)/(-2). (5.17) 
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Now we use the fact that I < 2k < 2en''/('^+i) , where S/{S + 1) = (r - 2)/(r - 1) for r G (3, 4) and 
S/(S + 1) = 2/3 < (r - 2)/(r - 1) for r > 4, while < An-(^^i)/(^+i), where (5 - l)/(5 + 1) = 
(r - 3)/(r - 1) for r G (3, 4) and {6 - l)/{6 + 1) = 1/3 < (r - 3)/(r - 1) for r > 4. Therefore, for 
r > 4, the left-hand side of (5.17) is 1 + o(l), while, for r G (3,4) we can bound it as 

Refl + V^n.(t) - V^(t) y > ^ _ 4^ _ 4^^(.-3)/(.-2)(i ^ A)|t|/V(-2) 

> 1 - 4ce - 4c£(""^)/("~2) (1 + A)K > 1/2, (5. 18) 

when 4ce(l + A) + 4c£:(^-2)/(^-2)jr < 1/2. In particular, for all \t\ < Ki, the left-hand side of (5.18) 
is non-negative. Therefore, we arrive at the claim that, for e > sufficiently small, there exists 
C = C{e,\) > such that, as / — )■ oo, 

/ Re{^n{tr^")')^>C f e-^l*l"dt + o(l). (5.19) 

J[-K,K] J\-K,K] 

This completes the proof of Lemma 5.1. □ 

Proof of Lemma 5.2. In the proof, we will make repeated use of the following lemma: 

Lemma 5.3 (Upper bound on tail probabilities of random sums). Suppose that (Ti)i>i are i.i.d. 
random variables for which there exist constants K > {] and 5 > 1 such that 

HTi >k)< Kk^^'^, (5.20) 

and let M he independent from (Ti)i>i. Then, there exists a constant cs > such that 

M 



J^T; > A;) < csKE[M]k~^/\ (5.21) 



1=1 

Proof. We split, depending on whether there exists a j G [M] such that Tj > k ot not, 

M M 

P(5^Tz > A;) < P(3j G [M] : T, > k) + P(5^T;%<fc} > k^ 
1=1 1=1 

< E[M]P(Ti >k) + -E[j2Tam<k} 

1=1 

1 ^ 

< KE[M]k^^/^ + -E[M] Vp(Ti > /) 

1=1 

< KE[M] {k~^'^ + 5Z ^^^'^) - csKE[M]k~^'\ (5.22) 

1=1 

where in the second inequality, we use Boole's inequality for the first term and Markov's inequality 
for the second. □ 
We write 

k-l 

P(T'^' > 2k, \C{Vn)\ <k) = ^P(r(^' > 2k, \C{Vn)\ = t). (5.23) 



t=i 
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For j G [n], let Mj{t) denote the number of times the mark j is drawn in the first t draws, and 
define Nf = {j G [n]: Mj{t) > 2}. For j G [n], let (Tj,,)^i be an i.i.d. sequence of random 
variables where Tj^i is the total progeny of a branching process which has a Poi(tyj) offspring in 
the first generation, and offspring distribution VoiiW*) in all later generations. When = t, 

but T'^' > 2k > t, then we must have that the total progeny of the thinned vertices is at least k. 
A vertex is thinned precisely when its mark is chosen at least twice, so that we can bound 

P(T(^) > 2k, \C{Vn)\ =t)< p(t(^) > 2k, \C{Vn)\ =t,J2 Tj,t > k^, (5.24) 
where, since all repetitions of vertex j after the first are thinned, 

M,(t)-1 
s=l 

Since t ^ Nt and t i— ?■ Mj(t) are non- decreasing, we arrive at 

P(T''> > 2k, \C{Vn)\ <k) < f(t^'^ > 2k, J2 Tj,k > k^ (5.26) 

Now, as in the proof of Lemma 5.3, we split the event 'Ylij^N,, ^ k into the event where all 
Tj^k < k and the event where there exists a, j E such that Tj ^ > k, to arrive at 

n 

P(T<^) > 2k, \C{Vn)\ <k)< ^P(r<^' > 2k, J G iVfc,f,-fc > k)+nT''' > 2k, ^ T^.a^f, ,.<fc} > k) 

< ^ [P(T'^^ > 2k, J G iVfc,T,-fc > k) + -^P(T(^) > 2k, J G iV,,T,-fc > /) 
j=i 1=1 

the last bound by the Markov inequality. 

Let {Ki,K2) be the first two times before T for which M^^ = = j- Then, noting that 
T'^' > ki needs to occur when Ki = ki, 

P(T(^' > 2k, J G iVfc, T;- > /) < ^(^'" ^ ^1' (^1' ^2) = (fci, ^2)) 

l<ki<k2<k 

X p(t,,, > / I {IU,K2) = {ki,k2)). (5.27) 

We wish to apply Lemma 5.3 to Tj^ in (5.25), where M is Mj{k) — 1 conditioned on {Ki,K2) = 
{ki, k2). When k < en^^-^^l^^~^\ 

nM,{k) - 1 I (Ki, K2) = {ki, k2)] = 2 + {k-k2)^<2 + = 0(1). (5.28) 

In order to apply Lemma 5.3, we proceed by checking (5.20). Note that {Tj^s)s>i is i.i.d. with 

p. 

Tj^i = 1 + "^i^iTi, where Pj ~ Poi(w;j) and where (T/)i>i is an i.i.d. sequence of total progenies 
of branching processes with offspring distribution Poi(W^). Thus, by Lemma 5.3, for which the 
assumption follows from Proposition 2.4(a) (using the fact that < 0), we have 

Pj 

P(T,-i > /) = P(^T, > / - 1) < csa^E[P^]r'/' = (5.29) 
1=1 
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Therefore, applying Lemma 5.3 yields that 



p(r(^' > 2k,j e iv,,f;-, > /) < ^ ^i'(^i'^2) = {h,h))^^^. (5.30) 



l<fcl<fe2<fc 



11/5 



The factor Wj appearing in (5.30) can be harmful when Wj is large. To resolve this problem, for j 
such that Wj > k^^^, we simply bound the restrictions on Tj ^ away, so that 

F{T^'^ >2k,\C{Vn)\ < k) (5.31) 

n 

^ 2 5^ E '^{^,<k-w^AT^'' > ku {Ku K,) = {k„ k,)) 

j=l l<ki<k2<k 

j=l l<ki<k2<k 1=1 

Performing the sum over / and bounding F{{Ki, K2) = {ki, ^2)) < {'^jl^nf' leads to 



j=l l<ki<k2<k 



j=l l<ki<k2<k 
^J.2-l/S 



-^1/5 



(5.32) 



When r > 4, we have that 5 = 2 and k < en'^^^, so that we can bound the above by 

1.2-2/5 u U3/2 

P(T(^' > 2k, \C(Vn)\ <k) < Cs EUW*?] = C5- = Cs A;"'/' < Cse'/\~^/^, (5.33) 

^ ^ n n n 

for some constant Cs > 0, so that (5.3) follows with q = 3/2. 

When r G (3,4), we use Lemma 3.3, now with k < en^'^^'^^^^'^"^^ and 5 = r — 2, to obtain 



U2-2/5 

P(T''' > 2k, |C(K)| < A;) < Cs E 



n 



1.2-1/5 

Cs- E 



n 



<Cs 



^(r-l)/(r-2) 



n 



for some constant Cs > 0, so that (5.3) follows with g = (r — l)/(r — 2) > L 



□ 
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